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ABSTRACT Influenza A virions contain eight ribonucleoproteins (RNPs), each comprised of a negative-strand viral RNA, the viral 
polymerase, and multiple nudeoproteins (NPs) that coat the viral RNA. NP oligomerization along the viral RNA is mediated 
largely by a 28-amino-acid tail loop. Influenza viral RNPs, which serve as the templates for viral RNA synthesis in the nuclei of 
infected cells, are not linear but rather are organized in hairpin-like double-helical structures. Here we present results that 
strongly support a coherent model for the assembly of the double-helical influenza virus RNP structure. First, we show that NP 
self-associates much more weakly in the absence of RNA than in its presence, indicating that oligomerization is very limited in 
the cytoplasm. We also show that once NP has oligomerized, it can dissociate in the absence of bound RNA, but only at a very 
slow rate, indicating that the NP scaffold remains intact when viral RNA dissociates from NPs to interact with the polymerase 
during viral RNA synthesis. In addition, we identify a previously unknown NP-NP interface that is likely responsible for organiz- 
ing the double-helical viral RNP structure. This identification stemmed from our observation that NP lacking the oligomeriza- 
tion tail loop forms monomers and dimers. We determined the crystal structure of this NP dimer, which reveals this new NP-NP 
interface. Mutation of residues that disrupt this dimer interface does not affect oligomerization of NPs containing the tail loop 
but does inactivate the ability of NPs containing the tail loop to support viral RNA synthesis in minigenome assays. 

IMPORTANCE Influenza A virus, the causative agent of human pandemics and annual epidemics, contains eight RNA gene seg- 
ments. Each RNA segment assumes the form of a rod-shaped, double-helical ribonucleoprotein (RNP) that contains multiple 
copies of a viral protein, the nudeoprotein (NP), which coats the RNA segment along its entire length. Previous studies showed 
that NP molecules can polymerize via a structural element called the tail loop, but the RNP assembly process is poorly under- 
stood. Here we show that influenza virus RNPs are likely assembled from NP monomers, which polymerize through the tail loop 
only in the presence of viral RNA. Using X-ray crystallography, we identified an additional way that NP molecules interact with 
each other. We hypothesize that this new interaction is responsible for organizing linear, single-stranded influenza virus RNPs 
into double-helical structures. Our results thus provide a coherent model for the assembly of the double-helical influenza virus 
RNP structure. 
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INTRODUCTION 

Influenza A virus, the causative agent of human pandemics and 
annual epidemics, is a negative-sense, single-stranded RNA vi- 
rus in the Orthomyxoviridae family. Within the pleomorphic lipid 
envelope of the virion are eight segments of virion RNA (vRNA). 
Each vRNA segment interacts with multiple nudeoproteins (NPs) 
and a heterotrimeric polymerase complex (3P, comprised of PA, 
PB1, and PB2) to form a viral ribonucleoprotein (vRNP) that 
functions in transcription, replication, and packaging of the viral 
genome ( 1 ) . Approximately 24 nucleotides (nt) of vRNA associate 
with each NP molecule (2, 3). Electron microscopy showed that 
influenza virus RNPs are not linear but rather are hairpin-like 



double-helical structures (2, 4-6). The viral polymerase, which 
binds to the common 3'- and 5 '-terminal sequences of the vRNA 
segments, is located at the hairpin termini (7). The location of the 
viral polymerase at the juxtaposed 3' and 5' termini was also ob- 
served in a reconstituted mini-RNP (8). In contrast to influenza 
virus RNPs, the RNPs of nonsegmented, negative-strand RNA 
viruses (e.g., rhabdoviruses) are linear and form single coils (9). 
The molecular basis for the double-helical structure of influenza 
virus RNPs has not been determined. 

The atomic structures of the NPs of two influenza A virus 
strains, H1N1 (influenza A/WSN/33 [WSN]) and H5N1 (influ- 
enza A/Hong Kong/483/97 [HK]), both in the form of a trimer, 
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have been determined to 3.2- and 3.5-A resolutions, respectively 
(10, 11). These two structures show that NP-NP interaction is 
mediated largely by a tail loop consisting of amino acid residues 
from positions 402 to 429. A highly positively charged groove is 
found at the exterior of the NP trimer, indicating that RNA is 
bound at the outer periphery of the viral RNPs (4, 12). This mode 
of RNA binding is consistent with previous findings that the viral 
RNAs in influenza virus RNPs are readily digested by RNase and 
are exposed to the solvent (12) and that polyvinylsulfate (PVS), a 
negatively charged polymer, is able to completely displace RNA 
from influenza virus RNPs (13). The putative RNA-binding 
groove of the influenza NP is lined with a large number of basic 
residues scattered along the NP primary sequence (10). Mutagen- 
esis analysis identified two polypeptide regions in the groove that 
are essential for RNA binding, one containing residues R74 and 
R75 and the other containing residues R174, R175, and R221 (11, 
14, 15). 

NP that is not associated with RNA is required for viral RNA 
replication, which occurs in two steps (16). First, a full-length 
copy of vRNA, termed complementary RNA or cRNA, is made 
and is then copied to produce vRNA. Whereas viral mRNA syn- 
thesis is initiated with capped RNA primers derived from cellular 
pre-mRNAs, the synthesis of cRNA and vRNA is initiated without 
a primer. Three amino acids (R204, W207, R208) in the loop at the 
top of the head domain of NP are required for both its binding to 
the viral polymerase and its ability to support viral RNA synthesis 
catalyzed by the viral polymerase (17). Several roles for the NP- 
polymerase interaction have been proposed, but no role has been 
definitively established (18-22). NP is deposited along the newly 
synthesized cRNA and vRNA chains during viral RNA synthesis 
(23, 24), so that cRNA and vRNA synthesis and RNP assembly are 
coupled. This coupling leads to the selective binding of NP to 
cRNA and vRNA but not to viral mRNA or cellular RNAs in in- 
fected cells. By itself, NP binds RNA nonselectively in vitro (12). 

It has not been determined how the transition of NP from 
monomer to oligomer is regulated in infected cells. Several obser- 
vations suggested that the oligomeric state of NP might be the 
predominant form of NP. For example, oligomeric ring structures 
of NP were predominantly present in purified, recombinant NP 
that was overexpressed in both Escherichia coli and insect cells ( 10, 
11). In addition, when vRNA was removed from vRNPs, a large 
proportion of the NP molecules still remained associated with 
each other in an RNP-like structure (25, 26). Because of these 
findings, Ruigrok and Baudin (25) raised an intriguing issue about 
RNP assembly: "it will be interesting to know whether NP in the 
infected cells is monomeric and, if so, whether other proteins are 
involved in preventing NP polymerization." In comparison, 
members of the Mononegavirales often encode a phosphoprotein, 
P, that keeps their N proteins in a soluble form prior to RNP 
encapsidation (27), but a comparable viral protein is lacking in 
orthomyxoviruses. 

To resolve this issue and to provide a coherent model for the 
assembly of the double-helical influenza virus RNP structure, we 
performed structural and biochemical characterization of NP in 
three different molecular forms, namely, oligomer, dimer, and 
monomer. In particular, we solved the crystal structure of an NP 
dimer to 2.8-A resolution, which revealed a previously unknown 
NP-NP interface that is likely responsible for organizing the 
double-helical viral RNP structure. Mutational disruption of the 
NP dimer interface resulted in NP molecules that were not able to 



support viral RNA synthesis, as determined in minigenome as- 
says, indicating the biological significance of the NP dimer inter- 
action that organizes influenza virus RNPs into their double- 
helical structure. 

RESULTS 

The NP monomer has only a weak self-association activity in the 
absence of RNA. Because influenza viral RNA synthesis occurs in 
the nucleus (1), newly synthesized NP in the cytoplasm is not 
expected to be associated with vRNAs or cRNAs. In addition, al- 
though NP is a nonspecific RNA-binding protein (28), it should 
not bind to a significant extent to cellular or viral mRNAs, because 
mRNAs are associated with a large array of proteins that compete 
with NP for binding (29). To determine whether NP self- 
associates efficiently in the absence of RNA, freshly purified H 1 N 1 
WSN NP protein at two different concentrations (18 and 300 |tiM) 
was equilibrated for 5 days at 4°C and then analyzed by gel filtra- 
tion (Fig. la). The amounts of trimers and monomers were quan- 
titated, and by assuming the dissociation constant (K d ) for the 
reversible conversion between NP 3 and 3 NPs, i.e., K d is equal to 
[NP] 3 /[NP 3 ],weestimatedai<: iJ of8.5 X 10" 3 M 2 . Such a large K d 
indicates that NP has only a weak self-association activity in the 
absence of RNA. Consequently, NP likely exists primarily as 
monomers in the cytoplasm before it is imported to the nucleus 
and participates in the assembly of viral RNPs. We conclude that it 
is probably unnecessary to invoke the action of another protein to 
prevent NP oligomerization in the cytoplasm. 

Assessing the role of NP oligomerization during viral RNA 
replication. We previously showed that NP-NP interactions are 
largely mediated by a tail loop consisting of amino acid residues 
from 402 to 429 (10). Because cRNA and vRNA synthesis is cou- 
pled with RNP assembly (23, 24), we considered it likely that NP 
proteins lacking a functional tail loop would be inactive in viral 
RNA replication. To determine whether this is the case, we assayed 
such NP mutant proteins for their ability to support viral RNA 
synthesis using minigenome assays and a vRNA-sense luciferase 
reporter, as described in Materials and Methods. As shown in 
Fig. 2a, a WSN NP mutant that lacks the tail loop (A402-429NP), 
as well as NP proteins that contain the R416A or E339A mutation, 
which disrupts a critical intersubunit salt bridge, have completely 
lost the ability to support viral RNA synthesis. These results are 
consistent with results from earlier mutagenesis analysis (30, 31). 
In contrast, the ability of NP to interact with polymerase proteins, 
as assayed in cotransfection assays, was not affected by such mu- 
tations (Fig. 2b). 

During viral RNA synthesis, a viral template RNA (vRNA or 
cRNA) has to dissociate from several, if not many, NP molecules 
in order to interact with the large trimeric polymerase. The NP 
scaffold in the RNA-free region remained intact only if NP oli- 
gomers are kinetically stable. To measure the stability of RNA-free 
NP oligomers, purified recombinant NPs lacking RNA, which are 
in the form of oligomers (i.e., mostly trimers), were incubated at 
37°C. These NP oligomers broke down to monomers in solution 
very slowly. For the NP of H1N1 WSN virus, gel filtration analysis 
of purified NP trimers showed that they dissociated to monomers 
via an approximate first-order reaction with a half-life (f 1/2 ) of 
=T0 h (Fig. lb). A large t l/2 was also found for the dissociation of 
the NP trimers of other influenza A virus strains, namely, an 
H3N2 virus (influenza A/Udorn/72 virus), Hong Kong (HK)/ 
H5N1 virus, and an H9N2 virus (influenza A/HK/1074/99). Con- 
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FIG 1 Oligomerization behavior of NP. (a) NP oligomerization at equilibrium. Freshly purified NP was diluted to different concentrations and allowed to 
equilibrate for 5 days at 4°C before being injected into a Superdex 200 size exclusion column. For NP samples at 16.5 mg/ml (300 /j,M) and 1 mg/ml (18 (iM), the 
proportions of trimers/monomers were estimated to be 70%/30% and 10%/90%, respectively, by integrating the areas under the two well-separated peaks in the 
chromatogram. These measurements gave rise to two independently determined K d s: 10 X 10~ 3 M 2 and 7.1 X 10~ 3 M 2 , assuming monomer-^trimer transition. 
The average of the two yielded the final K d of 8.5 X 10~ 3 M 2 . (b) Time-dependent dissociation behavior of the NP trimer (A/WSN/1933). Purified NP trimers, 
at a 1-mg/ml or 18 juM concentration, prepared in the storage buffer (50 mM Tris-HCl, pH 7.5, and 200 mM NaCl) were applied to a Superdex 200 size exclusion 
column to assess their size distribution at different time points (marked with different colors). A measurement taken at 72 h produced a curve that was nearly 
identical to the one at 40 h and therefore was omitted for clarity. 



sequently, these NP-NP interactions have extremely low dissoci- 
ation rates, indicating that the NP scaffold in the RNP complex 
likely remains intact during viral RNA synthesis, when template 
RNA has to dissociate from some NPs in order to interact with the 
polymerase. Consistent with this conclusion, RNPs have been 
found to retain their rod-like structure even after RNA was 
stripped away (25, 26). 

NP without the tail loop forms dimers via a different NP-NP 
interface. When analyzed by gel filtration, the A402-429NP mu- 
tant of WSN virus, as expected, no longer forms homo-trimers or 
larger homo-oligomers (Fig. 2c), as is also the case for the R416A 
or E339A mutant NP protein (data not shown). Unexpectedly, the 
NP tail loop mutant not only forms monomers but also forms a 
species at the ~110-kDa position (Fig. 2c), which corresponds to 
the molecular mass of an NP dimer containing two ~55-kDa NP 
monomers. This dimer species was also observed with the tail loop 
mutant of HK H5N1 NP (Fig. 2c). This formation of homo- 
dimers indicates that NP is able to dimerize presumably through a 
new interface that does not involve the tail loop. In fact, it appears 
that native NP is also capable of forming dimers. As shown in 
Fig. lb, a small population of dimer species was clearly visible in 
the native NP sample after most of the NP trimers had dissociated 
to monomers (Fig. lb). 

Crystal structures of the NP dimers. To identify the new 
dimer interface, we determined the crystal structure of the WSN 
A402-429NP dimer to 2.8-A resolution (Table 1; Fig. 3). We also 
attempted to crystallize the A402-429NP monomer but without 
success. In the NP dimer crystal, a dimer is found in each asym- 
metric unit, and it appears to be the only stable dimer based on 
buried surfaces (-2,100 A 2 ). As is the case in the NP trimer, each 
NP subunit in the dimer adopts a crescent-like shape with a head 
and a body domain. The two NP molecules in the dimer pack 
against each other in an antiparallel fashion, with the putative 
RNA-binding groove fully exposed in both subunits (Fig. 3a). The 
two NP subunits are essentially identical and can be superimposed 



with a root mean square difference (RMSD) of 1.1 A for 401 com- 
mon C-a atoms. By comparison, superimposing subunit B of the 
A402- 429NP dimer with B subunit of the WSN NP trimer yielded 
an RMSD of 1.4 A for 385 common C-a atoms, suggesting that the 
removal of the tail loop did not cause a large overall conforma- 
tional change in the tertiary structure of NP in the dimer form. 
Indeed, when the RNA-binding activity was measured using a 
20-nt RNA oligonucleotide, the A402-429NP dimer exhibited a 
binding affinity that was indistinguishable from that of the wild- 
type NP trimer (K d was equal to 3.9 ± 0.2 nM for the A491- 
498NP dimer versus 3.6 ± 0.5 nM for the wild type). 

Because of its higher resolution (i.e., 2.8 A), the crystal struc- 
ture of A402-429NP reveals more-ordered regions than the two 
previously published influenza A virus NP structures. Two of the 
newly defined regions are residues 73 to 91 and 490 to 498, both of 
which are disordered in the original trimeric H1N1 NP structure 
(Fig. 3b). Residues 73 to 91 emanate from the body domain and 
form the so-called loop LI, which folds back into the putative 
RNA-binding groove and interacts with loop L2 (Fig. 3b). The 
other two loops in and around the RNA-binding groove are L3 
and L4, numbered sequentially based on their primary sequence. 
Loops LI and L2 have been found to play a critical role in NP 
RNA binding (11). In particular, loop LI has the sequence 
73 ERRNKYLE EHPSAGKDPK K 91 , with 6 basic amino acid resi- 
dues (boldface), and deletion of residues 74 to 88 resulted in a 
6-fold decrease in the RNA-binding activity of the HK H5N1 NP 
(11). Loop L3 does not directly participate in RNA binding but is 
found to mediate the NP-polymerase interaction that is required 
for viral RNA synthesis (17). 

In both subunits of the A402-429NP dimer, the C-terminal 
tail, which contains amino acids 490 to 498, extends from the back 
of the body domain and winds into the putative RNA-binding 
groove at the front of the NP molecule (Fig. 3c). This C-terminal 
tail contains four acidic amino acid residues (D491, E494, E495, 
and D497) and two asparagines (N492 and N498), and therefore 
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FIG 2 NP tail loop mutants, (a) Viral RNA synthesis activities of NP mutants determined in the minigenome assay. A schematic diagram shows the tail loop 
interaction for the wild-type NP trimer and the residues mutated in the three mutants, (b) NP interaction with the PB1 and PB2 proteins. In the pulldown 
experiment, wild-type NP (wtNP) or A402-429NP, both containing a His tag at the C terminus, was coexpressed with untagged PB1 or PB2, respectively. 
Anti-PBl, anti-PB2, and anti-His antibodies were used for detection, (c) Oligomerization behavior of NP tail loop mutants. Wild- type NP and tail loop-deleted 
NP (A402-429NP) from both A/WSN/1933 (WSN) and A/Hong Kong/483/97 (HK97) viruses were freshly purified and applied to a Superdex 200 size exclusion 
column. Peak positions of protein standards are indicated by arrows. NP monomer, dimer, trimer, and higher-order oligomers are indicated by arrowheads. 



its electrostatic property is opposite to that of the positively 
charged surface of the putative RNA-binding groove. The acidic 
nature of the C-terminal tail is conserved across all members of the 
Orthomyxoviridae family. In one extreme case, the infectious 
salmon anemia virus (ISAV), an isavirus that is distantly related to 
the influenza A virus, has 12 acidic residues in the last 16 residues 
of its NP sequence (32). 

Comparison of the trimeric and dimeric NP interaction inter- 
faces shows that two different sets of residues are used for these 
two types of interactions (Fig. 4). The tail loop interaction is re- 
sponsible for NP oligomerization and involves residues in and 
around the tail pocket at the back of the NP molecule (Fig. 4b). In 
contrast, dimer interactions are mediated mostly by two polypep- 
tide regions: a helix-turn-helix motif comprised of residues 149 to 
167 and the C terminus from residues 482 to the end (Fig. 4a). 



Near the 2-fold symmetry axis, the side chains of residues 487 to 
489 close to the C-terminal tail are rearranged compared to those 
of the NP trimer. These movements allow the side chains of F489 
from the two subunits to stack against each other. Overall, the 
intersubunit interface buried in the trimer is more extensive 
(-8,331 A 2 ) than that in the A402-429NP dimer (-2,122 A 2 ). 

The NP dimer interface is required for viral RNA replication. 
To verify that the dimer interface is biologically relevant, we tried 
to identify amino acid residues that are critical for NP dimeriza- 
tion but do not affect oligomerization or RNA binding. We first 
determined which mutation in the NP mutant with the loop de- 
leted eliminated dimer formation. After testing various muta- 
tions, we showed that introducing a D491A mutation eliminated 
dimer formation (Fig. 5a and b). Although D491 is not on the 
dimer interface, its mutation apparently disturbs the conforma- 
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TABLE 1 Data collection and refinement statistics 



Parameter 


Valuefs)" 


Data collection 


Wavelength (A) 


0.9179 


Range of resolution (A) 


30-2.8 


Space group 


C2 


Cell dimensions (A) 


a = 60.2, b = 155.6, 




c = 97.8, (3 = 90.9° 


Mosaicity (°) 


0.83 


No. of unique reflections 


21,284 


Redundancy 


4 


Mean Ila(I) 


9.1 (2.5) 


Rmerge (%) ^^^^^^^^ 


11.5 (43.9) 


Completeness (%) 


97.6 (82.6) 




Refinement statistics 


Rwork/« fr ee (%) 


24.2/29.8 


RMS bond length (A)/angle (°) 


0.010/1.568 


Avg resolution of B factors (A 2 ) 


56.7 


Ramachandran plot data 


Most favored regions (%) 


83.4 


Additional allowed regions (%) 


16.6 


Generously allowed regions (%) 


0 


Disallowed regions (%) 


0 



a Data in parentheses are for the highest-resolution shell. 



tion of the C-terminal tail, eliminating dimer formation. It was 
also confirmed that the D491A mutation did not have any effect 
on regular NP oligomerization/trimerization through the tail loop 
(Fig. 5b). In addition to identifying D491A, we identified two 
other mutations, F487D and F487R, that were able to affect dimer 
interaction but not trimer formation (Fig. 5a and data not shown) . 
Of the other dimer mutations that we have made, R152A, D160A, 
R261A, and F489A result in partial but not complete disruption of 
NP dimer formation. The A49 1-498 mutation completely dis- 
rupted NP dimer formation, but considering the extensive nature 
of the deletion, functional characterization of this mutant was not 
further pursued. 

The ability of these three single-residue NP dimerization mu- 
tants to support viral RNA synthesis was then tested in minig- 
enome assays using both a vRNA-sense luciferase template and a 
cRNA-sense luciferase template (Fig. 5c and d). The Y487R NP 
mutant did not show any activity in either assay. Although the 
Y487D and D491A mutants showed approximately 10 to 20% 
activity in the vRNA luciferase assay, we did not detect any mRNA 
synthesis real time-PCR (RT-PCR). These two mutants showed 
similarly low activities in the cRNA assay, but, as shown, no 
vRNA-sense RNA was seen with 25 cycles of RT-PCR. Likewise, 
no band was observed with 30 cycles either. Additionally, we did 
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FIG 3 Structure of the A402-429NP dimer. (a) A402-429NP dimer. The molecule is viewed along the noncrystallographic 2-fold symmetry axis. The two 
subunits are shown in red and green. Arrows point to the potential RNA-binding grooves. Structurally disordered regions in each molecule are noted in the main 
text, (b) Subunit B of the A402-429NP dimer. The polypeptide chain is rainbow colored from blue to red continuously from the N to the C terminus. The four 
loops (LI to L4) in/near the potential RNA-binding groove are highlighted. Important residues from loops LI to L3 are shown as sticks and highlighted in 
magenta, (c) C-terminal tail of A402-429NP. Residues 490 to 498 are shown as sticks, and the rest of the molecule is a surface representation colored according 
to electrostatic potential, with positive in blue and negative in red. The viewing direction is similar to that in panel b. 
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Dimer interface Trimer interface 

FIG 4 A402-429NP dimer interface, (a) Subunit B of the NP dimer in ribbons (left) and as a surface representation (right). Residues within a 3.6-A distance 
of the other subunit of the dimer are highlighted in red. The viewing direction is from the back of the NP molecule, such that the RNA-binding groove is located 
on the other side of the molecule, (b) Subunit C of the NP trimer (A/WSN/33). Surface regions within a 3.6-A distance of the tail loop of subunit A are shown in 
red. Molecules in both panels are shown in the same orientation. 



not detect smaller RNAs with the three dimerization mutants. Our 
results thus demonstrate the biological importance of the NP 
dimer interaction. 

DISCUSSION 

Biological role of the NP tail loop and the dimer interface. We 

have identified three different molecular forms of NP, namely, 
trimers, dimers, and monomers. The NP molecules that form 
trimers oligomerize by interacting with each other largely via the 
tail loop and contain two structural domains, as previously de- 
scribed (10,11). The NP dimer lacking the tail loop consists of two 
NP subunits that pack against each other side-by-side in an an- 
tiparallel fashion through mostly hydrophobic interactions and 
cannot further polymerize. The NP monomer, which likely has a 
structure similar to that of trimeric NP, is in dynamic equilibrium 
with NP oligomers, but these NP-NP interactions are weak in the 
absence of any associated RNA. 

NP residues that mediate dimer formation are mostly different 
from those used for tail loop interactions, suggesting that these 
two types of interactions are nonexclusive and can occur simulta- 
neously within a higher-order complex. The NP dimer crystal 
structure shows that two polypeptide regions constitute most of 
the dimer interface: the helix-turn-helix motif comprised of resi- 
dues 149 to 167 and the C-terminal tail from residue 482 to the 
end. The critical role of the C-terminal tail of NP in NP dimer 
formation is evident from our mutagenesis data, as the disruption 
of the structural conformation of the C-terminal tail alone re- 
sulted in NP mutants that were not able to form any dimers. In- 
terestingly, residues 490 to 498 are disordered in the original NP 
trimer crystal structure, and as expected, the deletion of residues 
491 to 498 had no effect on NP's ability to form trimers and larger 
oligomers (data not shown). In addition to mediating dimer in- 
teraction, the C-terminal tail of NP may help to regulate NP RNA- 
binding activity and adopt different conformations in different 
functional states (e.g., monomeric versus oligomeric NP, RNA- 
bound versus RNA- free NP). We found that the RNA-binding 
activity of the A491-498 mutant was enhanced by ~2-fold com- 
pared to that of the wild-type NP protein (K d was equal to 1.7 ± 
0. 1 nM for the A491-498 mutant versus 3.6 ± 0.5 nM for the wild 
type). 



While the tail loop functions in maintaining interactions be- 
tween adjacent NP molecules in influenza virus RNPs, the NP 
dimer interaction has the potential to organize the RNP into its 
double-helical structure (Fig. 6). In any double-helical structures, 
two sets of stabilizing forces are required. The first set is between 
adjacent molecules on the same polymer strand, and the second is 
between molecules from the two opposing polymer strands of the 
double-helical structure. As shown in Fig. 6, our double-helical 
RNP model entails two types of NP interactions. One is the tail 
loop interaction, which is used to maintain connection between 
adjacent NP molecules that point in the same direction and bind 
to the same RNA strand. The other is the dimer interaction made 
between antiparallel NP molecules that are associated with two 
antiparallel RNA strands. Disruption of the NP dimer interface 
leads to the unwinding of the RNP, resulting in the dissociation of 
the two antiparallel RNA strands, as previously observed under 
high-salt and low-salt buffer conditions (7). Because the RNA- 
binding groove is located at the exterior of the NP dimer structure, 
bound RNAs are likely to be solvent exposed in the influenza virus 
RNPs. Since NP forms the internal protein scaffold of RNP based 
on our model, the overall structure of RNP is expected to remain 
intact after bound RNA is dissociated, consistent with an earlier 
observation (25). We speculate that the NPs of nonsegmented, 
negative-strand RNA viruses lack such a dimer interface, explain- 
ing why the RNPs of these viruses are linear rather than double- 
helical. Interestingly, the RNPs of another group of segmented, 
negative-strand RNA viruses, bunyaviruses, also appear to be 
double-helical (33), suggesting that a dimer interface similar to 
that described here for the influenza virus NP may be shared by 
other members of this group of RNA viruses. 

We have shown that NP mutants that cannot dimerize also 
cannot support viral RNA synthesis, as determined in minig- 
enome assays. In these assays, the vRNAs and cRNAs that are 
produced are presumably in the form of RNPs, as is the case in 
infected cells. These assays measure the steady-state levels of 
vRNPs or complementary RNPs (cRNPs) that are produced. Con- 
sequently, the absence of newly synthesized vRNPs and cRNPs are 
consistent with two possibilities, which are not mutually exclu- 
sive: (i) elongation of vRNA and cRNA chains stops at the point at 
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FIG 5 The NP dimer interface is important for viral RNA replication, (a) Residues D491 and Y487 in the NP dimer and trimer. Different subunits in the NP 
dimer and trimer are shown in different colors. C-terminal tail residues 487 to 498 are highlighted in red. In the NP trimer, residues after 489 are disordered, and 
therefore D491 is missing from the structure, (b) Gel filtration chromatogram for D491 mutants. NP dimers were not observed when D491 was introduced into 
the A402-429 loop mutant, (c) RNA synthesis activities of NP mutants determined in minigenome assays using a vRNA-sense template. The amount of NP 
expressed in transfected cells is shown at the bottom, (d) RNA synthesis activities of NP mutants determined in minigenome assays using a cRNA-sense template. 
The amounts of NP proteins expressed in transfected cells are shown at the bottom. The amounts of vRNA detected by 25 cycles of RT-PCR are also shown. IB, 
immunoblot. 



which the normal double-helical region of the RNPs is reached, 
and the resulting small single-stranded RNPs are unstable, and/or 
(ii) partial or complete single-stranded RNPs are produced but are 
inherently unstable or cannot be used as the template for further 
RNA synthesis. In any case, these results indicate that the ability to 
form the NP-NP interface needed for the formation of a double- 
helical structure is biologically important. 



NP trafficking and functions in infected cells. Our results 
support the model for NP trafficking and functions in infected 
cells outlined in Fig. 6b. Newly synthesized NP exists as monomers 
in the cytoplasm due to the weak NP-NP association in the ab- 
sence of bound RNA. After importation into the nucleus, NP mol- 
ecules are added to the growing end of the assembling RNP com- 
plex, where monomeric NP polymerizes with RNA-bound NP 
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FIG 6 Models for RNP (ribonucleoprotein) assembly and function, (a) Sta- 
bilizing forces in a double-helical RNP. Cyan spheres represent NP molecules, 
and blue ovals represent the viral polymerase complex. The RNA molecule is 
shown by the green curve, (b) RNP assembly and function. Monomeric and 
polymeric forms of NP are shown by irregularly shaped and circular cyan 
objects, respectively. The green curves represent vRNA or cRNA. 



through the tail loop interaction. Fully assembled RNP adopts a 
double-helical structure, mediated by the dimer interface revealed 
in the A402-429NP dimer structure described here. 

During viral RNA transcription and replication, it is necessary 
for vRNA (and cRNA) to dissociate from at least part of the NP 
scaffold to access the polymerase active site in order to function as 
a template. This dissociation does not require the disruption of the 
NP scaffold because the vRNA is exposed on the outer surface of 
the RNP. Our results suggest that the structure of RNP is main- 
tained primarily by NP protein-protein contacts (i.e., the tail loop 
between adjacent NP molecules and the dimer interaction be- 
tween strands). Because we have shown that NP-NP interaction 
has a low dissociation rate constant, the NP scaffold is likely to 
remain intact when the vRNA template locally dissociates from 
the RNP during RNA synthesis, consistent with a previous obser- 
vation that the overall structure of RNP is maintained even after 
vRNA is removed (25). After a vRNA region is copied by the poly- 
merase, it can then readily reassociate with the NP scaffold. The 
vRNP and cRNP thus maintain their structural stability during the 
course of infection once they are properly assembled. 

MATERIALS AND METHODS 

Plasmid constructs and protein purification. NP genes from A/WSN/ 
1933 (H1N1) and A/HK/483/97 (H5N1) were cloned into pET28a( + ) 
vectors and expressed in E. coli Rosetta 2(DE3) Singles competent cells 
(Novagen) (10). A C-terminal 6XHis tag was engineered to facilitate pu- 
rification by affinity chromatography. Nontagged NPs from both the 
H1N1 and H5N1 viruses were also expressed for control purposes. NP 
mutants were constructed using standard oligonucleotide mutagenesis 
methods and were expressed using the same system. E. coli cells 
were grown in Luria broth medium at 37°C to an optical density at 
590 nm (OD 590 ) of 0.6 and induced by 1 mM isopropyl-j3-D- 
thiogalactopyranoside (IPTG) for 6 h at 28°C. 



For insect cell expression, constructs of untagged and C-terminally 
His-tagged NPs and their mutants were cloned into the baculovirus vector 
pFastBac and expressed in Sf21 insect cells using the Bac-to-Bac baculo- 
virus expression system (Invitrogen). Baculoviruses expressing the three 
polymerase protein subunits were constructed, coexpressed, and purified 
as previously described (19). Sf21 insect cells were grown in Hink's 
TNM-FH insect medium (JRH Biosciences) supplemented with 10% 
(vol/vol) fetal bovine serum (Valley Biomedical) and harvested 48 h 
postinfection. 

Purification of NP, overexpressed in either E. coli or insect cells, was 
performed as previously described (10). Briefly, clarified cell lysate was 
initially subjected to a Ni-nitrilotriacetic acid (NTA) column (for His- 
tagged proteins) or to an ammonium sulfate precipitation step (for 
nontagged proteins) . Fractions containing NP were subsequently purified 
by heparin affinity and gel filtration chromatography. The final yield was 
about 1.5 mg of NP dimer and 5 mg of NP monomer per liter of cells. The 
purified protein (-95% pure) was concentrated to 5 mg/ml using a Cen- 
tricon tube (Millipore) and stored at -80°C. 

Crystallization and structure determination. Crystallization condi- 
tions were initially screened using a Hydra II Plus One microdispenser 
robot (Apogent Discoveries) with ready-to-use crystal screen kits (Hamp- 
ton Research, Qiagen). Rectangular crystals (~ 10 by 20 by 80 ^m 3 ) of the 
A402-429NP dimer were grown at room temperature by vapor diffusion 
with hanging drops with a ratio of protein to well solution of 1 : 1 . The well 
solution contained 100 mM Tris-HCl, pH 8.5, 18% polyethylene glycol 
3350 (PEG 3350), 10% glycerol, and 10 mM dithiothreitol (DTT). The 
A402-429NP dimer crystallized as small rectangles. Crystals were trans- 
ferred to stabilizing solution (100 mM Tris-HCl, pH 8.5, 20% PEG 3500, 
10% glycerol, 20% PEG 400) and flash frozen in liquid nitrogen. Data 
were collected at the Cornell High Energy Synchrotron Source (CHESS) 
with beamline Fl. All data were processed with HKL-2000 (Table 1) (34). 

The structure of the A402-429NP dimer was solved by molecular 
replacement to 2.8-A resolution using Phaser (35) and CCP4 (36). The 
A/WSN/33 (H1N1) NP trimer structure (10) was used as the initial phas- 
ing model, and it was found that there were two molecules per crystallo- 
graphic asymmetric unit. The initial map was built using program O (37). 
To reduce model bias, composite omit maps and density modification 
were used, and the model was subjected to multiple cycles of simulated 
annealing with the Crystallography &. NMR System (CNS) (38). The two 
noncrystallographic symmetry (NCS)-related NP molecules are indepen- 
dently refined without any restraints/constraints. After position and ther- 
mal parameter refinement, the R work and R free converged to 0.2415 and 
0.2978, respectively. In the final model, subunit B contains 424 out of 
470 amino acids (excluding the 28 residues in the tail loop), including 
residues 2 1 to 389, 438 to 450, and 460 to 498; subunit A is less ordered and 
contains 18 fewer residues. Specifically, residues 390 to 392 are ordered in 
subunit A, but residues 83 to 89, 202 to 212, and 451 to 453 are disordered. 
The Ramachandran plot showed that 83.4 and 16.6% of residues were 
located in the most favored regions and additional allowed regions, re- 
spectively. 

FP. Fluorescence polarization (FP) was measured using a PanVera/ 
Beacon 2000 FP system at room temperature. Samples were excited at 
490 nm, and emission was measured at 520 nm. The reaction buffer used 
for all experiments contained 20 mM Tris, pH 7.5, 200 mM NaCl, 5 mM 
2-mercaptoethanol, and 1 mM EDTA. A 5'-fluoresceinated 20-mer RNA 
oligonucleotide (5' AGUAGAACAGGGUGACAAAG 3' with the con- 
served 5' vRNA sequence) ( 12) was chemically synthesized (Dharmacon). 
For each experiment, protein was serially titrated into a 0.99-ml reaction 
buffer containing 0.2 nM fluoresceinated oligonucleotide until the reac- 
tion reached saturation. Each protein titration was repeated in triplicate. 
The data were fitted to a sigmoidal curve using Igor Pro (WaveMetrics, 
Portland, OR). 

His-tagged pulldown. C-terminally 6XHis-tagged wild-type NP and 
A402-429NP were coexpressed with untagged PB1 and PB2, respectively, 
using the Bac-to-Bac baculovirus expression system (Invitrogen). After 
sonication and centrifugation, the cell supernatant was incubated with 
Ni-NTA resin (Qiagen), washed, and eluted as previously described. Pro- 
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teins were analyzed by 9% SDS-PAGE and Western blotting. Anti-PBl, 
anti-PB2, and anti-tetra-His (Qiagen) were used as primary antibodies, 
depending on which proteins were coexpressed. Goat anti-mouse IgG and 
anti-rabbit IgG conjugated with alkaline phosphate (Pierce) were used as 
secondary antibodies. The blot was developed with SigmaFAST 5-bromo- 
4-chloro-3-indolylphosphate (BCIP)-nitroblue tetrazolium (NBT). 

Minigenome assays. Dual-luciferase reporter assays for viral RNA 
synthesis were carried out as previously described (17). The vRNA tem- 
plate was provided by a pHH2 1 plasmid expressing an RNA containing 
the 26 5'- and 3'-terminal bases of the Ud NS vRNA sandwiching a 
negative-sense firefly luciferase reading frame. Alternatively, a cRNA tem- 
plate was provided by a pHH21 plasmid expressing the 5'- and 3'- 
terminal bases of Ud NS cRNA sandwiching a positive-sense firefly lu- 
ciferase reading frame. To monitor transfection efficiencies, a polymerase 
II-driven plasmid expressing Renilla luciferase was also transfected. The 
firefly/ Renilla luciferase ratio measures the level of viral RNA synthesis. 
The amount of viral RNA produced (cRNA or vRNA) in these assays was 
also directly measured by semiquantitative RT-PCR. 

Protein structure accession number. Crystal structure coordinates 
have been deposited in the Protein Data Bank (PDB; identifier, 3MIR). All 
ribbon diagrams were made using the program PyMOL (W. L. Delano; 
http://www.pymol.org). 
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